[UR][Benchmarks] Add flamegraphs to benchmark results #19678

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

uditagarwal97 merged 10 commits into intel:sycl from mateuszpn:flamegraphs

Sep 2, 2025

Contributor

mateuszpn commented Aug 1, 2025 •

edited

Loading

Adds presentation of perf results as flamegraphs
Run main.py with:
--flamegraph [exclusive] to create flamegraphs
--flamegraph force to create flamegraphs also for benchmarks marked as non-traceable
--flamegraph inclusive to create flamegraphs along with regular benchmarks
(the same options work for unitrace logs, with --unitrace)
Number of internal iterations in ComputeRuntime benchmark is reduced when generatin traces/flamegraphs

mateuszpn added 2 commits

July 31, 2025 15:43


          extend data.js with necessary variables

261c837


          Add flamegraphs to benchmarks

a51c767

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn changed the title ~~[UR][Benchmarks]~~ [UR][Benchmarks] Add flamegraphs to benchmark reslts

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 13:58

— with

GitHub Actions Inactive

mateuszpn changed the title ~~[UR][Benchmarks] Add flamegraphs to benchmark reslts~~ [UR][Benchmarks] Add flamegraphs to benchmark results

mateuszpn marked this pull request as ready for review

August 4, 2025 13:59

mateuszpn requested a review from a team as a code owner

August 4, 2025 13:59

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:19

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:19

— with

GitHub Actions Inactive

mateuszpn force-pushed the flamegraphs branch from dfbfad0 to e4dd31b Compare

August 4, 2025 14:31

mateuszpn requested a review from a team as a code owner

August 4, 2025 14:31

mateuszpn had a problem deploying to WindowsCILock

August 4, 2025 14:31

— with

GitHub Actions Error

mateuszpn force-pushed the flamegraphs branch from e4dd31b to e0390c8 Compare

August 4, 2025 14:32

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 14:32

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 15:07

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 4, 2025 15:07

— with

GitHub Actions Inactive


          Merge remote-tracking branch 'upstream/sycl' into flamegraphs

e0390c8

Signed-off-by: Mateusz P. Nowak <[email protected]>

pbalcer reviewed

View reviewed changes

devops/scripts/benchmarks/benches/base.py Outdated

Comment on lines 128 to 131

    
                      run_unitrace=False,

                      extra_unitrace_opt=None,

                      run_flamegraph=False,

                      extra_perf_opt=None,  # VERIFY

Contributor

pbalcer Aug 6, 2025

You already added a tracing type enum. I'd extend this to be some sort of generic "TraceTool", and here, in run_bench, I suggest simply accepting a generic trace tool (I imagine you wouldn't want to enable two at the same time).

devops/scripts/benchmarks/output_html.py Outdated

    
                      data_path = os.path.join(html_path, f"{filename}.js")

                      # Check if the file exists and has flamegraph data that we need to preserve

                      existing_flamegraph_data = None

Contributor

pbalcer Aug 6, 2025

why do we need to do this? we store all the other results separately from the html output.

pbalcer reviewed

View reviewed changes

devops/scripts/benchmarks/utils/flamegraph.py Outdated

    
                          options.workdir,

                          "flamegraph-repo",

                          "https://github.com/brendangregg/FlameGraph.git",

                          "master",

Contributor

pbalcer Aug 6, 2025

don't clone master, use a fixed commit.

devops/scripts/benchmarks/utils/flamegraph.py Outdated

    
                          "master",

                      )

                      # FlameGraph doesn't need building, just verify scripts exist and are executable

Contributor

pbalcer Aug 6, 2025

We don't check this anywhere else. I'm not sure if this would ever find an issue?

devops/scripts/benchmarks/utils/flamegraph.py Outdated

    
                          )

                  def _prune_flamegraph_dirs(self, res_dir: str, FILECNT: int = 10):

                      """Keep only the last FILECNT files in the flamegraphs directory."""

Contributor

pbalcer Aug 6, 2025

this seems similar to what you have for unitrace. can we share code?

devops/scripts/benchmarks/utils/flamegraph.py Outdated

    
                              "record",

                              "-g",  # Enable call-graph recording

                              "-F",

                              "99",  # Sample frequency

Contributor

pbalcer Aug 6, 2025

this seems low. we should experiment with different values and pick what gives us the best flamecharts.

devops/scripts/benchmarks/utils/flamegraph.py Outdated

Comment on lines 142 to 227

    
                  def handle_output(self, bench_name: str, perf_data_file: str):

                      """

                      Generate SVG flamegraph from perf data file.

                      Returns the path to the generated SVG file.

                      """

                      if not os.path.exists(perf_data_file) or os.path.getsize(perf_data_file) == 0:

                          raise FileNotFoundError(

                              f"Perf data file not found or empty: {perf_data_file}"

                          )

                      # Generate output SVG filename following same pattern as perf data

                      svg_file = perf_data_file.replace(".perf.data", ".svg")

                      folded_file = perf_data_file.replace(".perf.data", ".folded")

                      try:

                          # Step 1: Convert perf script to folded format

                          log.debug(f"Converting perf data to folded format: {folded_file}")

                          with open(folded_file, "w") as f_folded:

                              # Run perf script to get the stack traces

                              perf_script_proc = subprocess.Popen(

                                  ["perf", "script", "-i", perf_data_file],

                                  stdout=subprocess.PIPE,

                                  stderr=subprocess.DEVNULL,

                                  text=True,

                              )

                              # Pipe through stackcollapse-perf.pl

                              stackcollapse_perf_path = os.path.join(

                                  self.repo_dir, "stackcollapse-perf.pl"

                              )

                              stackcollapse_proc = subprocess.Popen(

                                  [stackcollapse_perf_path],

                                  stdin=perf_script_proc.stdout,

                                  stdout=f_folded,

                                  stderr=subprocess.DEVNULL,

                                  text=True,

                              )

                              perf_script_proc.stdout.close()

                              stackcollapse_proc.wait()

                              perf_script_proc.wait()

                          # Step 2: Generate flamegraph SVG

                          log.debug(f"Generating flamegraph SVG: {svg_file}")

                          flamegraph_pl_path = os.path.join(self.repo_dir, "flamegraph.pl")

                          with open(folded_file, "r") as f_folded, open(svg_file, "w") as f_svg:

                              flamegraph_proc = subprocess.Popen(

                                  [

                                      flamegraph_pl_path,

                                      "--title",

                                      f"{options.save_name} - {bench_name}",

                                      "--width",

                                      str(

                                          self.FLAMEGRAPH_WIDTH

                                      ),  # Fit within container without scrollbars

                                  ],

                                  stdin=f_folded,

                                  stdout=f_svg,

                                  stderr=subprocess.DEVNULL,

                                  text=True,

                              )

                              flamegraph_proc.wait()

                          # Clean up intermediate files

                          if os.path.exists(folded_file):

                              os.remove(folded_file)

                          if not os.path.exists(svg_file) or os.path.getsize(svg_file) == 0:

                              raise RuntimeError(f"Failed to generate flamegraph SVG: {svg_file}")

                          log.debug(f"Generated flamegraph: {svg_file}")

                          # Create symlink immediately after SVG generation

                          self._create_immediate_symlink(svg_file)

                          # Prune old flamegraph directories

                          self._prune_flamegraph_dirs(os.path.dirname(perf_data_file))

                          return svg_file

                      except Exception as e:

                          # Clean up on failure

                          for temp_file in [folded_file, svg_file]:

                              if os.path.exists(temp_file):

                                  os.remove(temp_file)

                          raise RuntimeError(f"Failed to generate flamegraph for {bench_name}: {e}")

Contributor

pbalcer Aug 6, 2025 •

edited

Loading

use run helpers... I suggest expanding run to support redirecting stdout to a file (and then not printing it to the console).

devops/scripts/benchmarks/utils/flamegraph.py Outdated

    
                      except Exception as e:

                          log.debug(f"Failed to create immediate symlink for {svg_file}: {e}")

                  def _update_flamegraph_manifest(

Contributor

pbalcer Aug 6, 2025

I genuinely don't understand the idea here. data.js is purely an output file. we should not parse it.


          change use of data.js

0987a08

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/benches/base.py Outdated Show resolved Hide resolved

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/options.py Outdated Show resolved Hide resolved


          significantly rebuild

86f36f7

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn force-pushed the flamegraphs branch from 9ec5c0c to 2f85291 Compare

August 26, 2025 15:09


          add force option, omitting traceable()

2f85291

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 11:37

— with

GitHub Actions Inactive

mateuszpn marked this pull request as draft

August 27, 2025 11:44

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 11:58

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 11:58

— with

GitHub Actions Inactive


          Reduce internal iterations in ComputeBenchmarks

3bbc26d

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 12:46

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 13:07

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 13:07

— with

GitHub Actions Inactive


          Merge remote-tracking branch 'upstream/sycl' into flamegraphs

2596e29

Signed-off-by: Mateusz P. Nowak <[email protected]>

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 15:16

— with

GitHub Actions Inactive

mateuszpn marked this pull request as ready for review

August 27, 2025 15:17

mateuszpn requested review from PatKamin and pbalcer

August 27, 2025 15:17

mateuszpn temporarily deployed to WindowsCILock

August 27, 2025 16:01

— with

GitHub Actions Inactive

mateuszpn had a problem deploying to WindowsCILock

August 27, 2025 16:01

— with

GitHub Actions Failure


          frequency update

Signed-off-by: Mateusz P. Nowak <[email protected]>

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/benches/base.py Outdated Show resolved Hide resolved

devops/scripts/benchmarks/benches/compute.py Outdated Show resolved Hide resolved

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/benches/compute.py Outdated Show resolved Hide resolved

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/benches/compute.py Outdated Show resolved Hide resolved

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/main.py Outdated Show resolved Hide resolved

devops/scripts/benchmarks/main.py Outdated Show resolved Hide resolved

PatKamin reviewed

View reviewed changes

devops/scripts/benchmarks/options.py Show resolved Hide resolved

mateuszpn temporarily deployed to WindowsCILock

August 28, 2025 12:39

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 28, 2025 13:21

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

August 28, 2025 13:44

— with

GitHub Actions Inactive

mateuszpn had a problem deploying to WindowsCILock

August 28, 2025 13:44

— with

GitHub Actions Failure

mateuszpn force-pushed the flamegraphs branch from 37e81a5 to d51734c Compare

September 2, 2025 11:10

mateuszpn temporarily deployed to WindowsCILock

September 2, 2025 11:10

— with

GitHub Actions Inactive

mateuszpn requested a review from PatKamin

September 2, 2025 11:11

mateuszpn temporarily deployed to WindowsCILock

September 2, 2025 11:33

— with

GitHub Actions Inactive

mateuszpn temporarily deployed to WindowsCILock

September 2, 2025 11:33

— with

GitHub Actions Inactive

PatKamin approved these changes

View reviewed changes


          apply comments

d51734c

Signed-off-by: Mateusz P. Nowak <[email protected]>

Contributor Author

mateuszpn commented Sep 2, 2025

@intel/llvm-gatekeepers This is approved by owner and ready to merge

uditagarwal97 merged commit 48e397c into intel:sycl

27 checks passed

mateuszpn deleted the flamegraphs branch

October 22, 2025 12:13

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet